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1 .  INTRODUCTION 


If  the  wind  components  at  one  pressure  level  of  the  atmosphere  are  given 
and  the  vertical  wind  shear  is  known  between  the  given  level  and  another 
level,  the  winds  at  the  second  level  can  be  inferred.  Over  data-sparse 
regions  of  the  earth,  such  as  the  oceans,  winds  usually  are  known  at  only  a 
few  levels  because  the  data  are  limited  to  aircraft  reports,  cloud  motion 
vectors  derived  from  time-lapse  sequences  of  satellite  cloud  pictures,  or 
constant  pressure-level  balloons.  However,  if  in  addition  the  vertical  wind 
shear  were  known,  then  the  winds  could  be  determined  at  intermediate  levels 
as  well.  It  is  the  purpose  of  this  report  to  examine  from  a  theoretical 
point  of  view  the  possibility  of  deriving  the  vertical  shear  of  the  geostrophic 
wind  from  horizontal  gradients  of  satellite-measured  radiances. 

Work  of  this  kind  has  been  done  by  Zak  and  Panofsky  (1968)  and  by  Grody, 
et  al.  (1979),  but  in  both  papers  the  authors  used  radiances  from  only  one 
spectral  interval  (channel).  It  is  shown  in  this  report  that  the  use  of 
linear  combinations  of  radiances  from  many  channels  yields  greater  accuracy. 
Furthermore,  an  extensive  study  is  made  to  determine  the  best  pressure  inter¬ 
vals  to  use  for  the  Special  Sensor-H  (SSH)  sounder  of  the  Defense  Meteorologi¬ 
cal  Satellite  Program  (DMSP)  and  for  the  Tiros-N  Operational  Vertical  Sounder 
(TOVS)  instrument  package  on  the  TIROS-N/NOAA  series  of  operational  satellites. 
Once  the  optimum  pressure  interval  is  determined,  one  of  the  boundaries  of 
the  interval  becomes  the  pressure  level  at  which  the  winds  must  be  known  and 
the  other  boundary  becomes  the  pressure  level  at  which  the  winds  can  be 
inferred. 

2.  THEORY 

The  relationship  between  the  vertical  atmospheric  temperature  profile  as 
a  function  of  pressure  T(p)  and  the  corresponding  radiance  R^  emerging  from 
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the  top  of  the  atmosphere  and  measured  by  satellite  over  a  wavenumber  inter¬ 
val  (channel)  centered  at  V  is  given  by  the  radiative  transfer  equation 


f  dTv(p) 

RV  -  /  Bv  IT(P)J  "d£  1  dp- 


(1) 


In  Eq.  (1)  B^(T)  is  the  Planck  function  at  wavenumber  V  and  temperature  T, 

T^(p)  is  the  fractional  atmospheric  transmittance  between  the  pressure  level 

p  and  the  effective  top  of  the  atmosphere  p  ,  and  p  is  the  surface  pressure. 

1.  s 

Note  that  the  usual  boundary  term  in  Eq.  (1)  is  missing  because  only  those 
channels  that  do  not  sense  the  ground  radiation  are  used.  The  reason  for 
this  is  that  it  would  be  difficult  to  interpret  the  gradient  of  such 
radiances.  Furthermore,  the  geostrophic  wind  equation  is  not  valid  in  the 
boundary  layer  because  of  surface  frictional  forces. 

Another  relationship  that  will  be  needed  is  the  one  between  the  geo¬ 
strophic  wind  shear  and  the  horizontal  gradient  of  temperature  which  is 
given  by  the  thermal-wind  equation 

-+ 

* 1TK5T  ‘  V-  (2) 


where  V  is  the  geostrophic  wind  vector,  k  is  the  vertical  unit  vector, 
is  the  gradient  operator  on  the  surface  of  constant  pressure,  and  the  scalar 


a  »  f/R 

is  the  ratio  between  the  coriolis  parameter  f  and  the  gas  constant  for  air 
R. 

To  make  Eqs.  (1)  and  (2)  compatible,  one  can  choose  a  fixed  vertical 
temperature  profile  Tq  that  is  sufficiently  representative  of  the  area  to 
which  the  gradient  operator  is  being  applied  that  the  following  Taylor 
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which  is  one  of  the  fundamental  equations  we  will  need.  A  second  basic 
result  is  obtained  by  integrating  (8)  by  parts  to  obtain 


-  f*  -  dKv(t> 

VpRv  -  -ak  x  /  V(t)  -  dt,  (9) 

CT 

where  the  boundary  term  vanishes  because  K  (t)  vanishes  at  t  and  at  t  . 

V  S  A 

One  now  can  work  with  either  Eq.  (8)  or  (9). 

The  approach  of  Zak  and  Panofsky  (1968)  will  be  followed  first  by  con¬ 
sidering  Eq.  (9).  Since  the  kernel  function  K^(t)  somewhat  resembles  a 
normal  density  probability  function  in  that  it  has  a  single  peak  value  and 
falls  off  to  zero  on  each  side  of  its  peak  value,  its  derivative  has  a 
bimodal  shape.  An  example  of  such  a  curve  is  given  in  Fig.  1  by  the  solid 
curve  labeled  D^.  The  shape  of  curve  is  typical  of  the  function  dK^(t)/dt  in 
that  it  has  well  defined  positive  and  negative  modes  which  permit  Eq.  (9) 
to  be  written  in  the  form 


7  R 
P  v 


V(t) 


dK/t) 

dt 


dt 


V(t) 


dK  (t) 

v 

dt 


(10) 


where  tc  is  the  zero-crossing  level.  Since  the  differentiated  kernels  in  (10) 
are  both  positive,  the  second  mean-value  theorem  for  integrals  applies,  and 
one  can  write  Eq.  (10)  in  the  form 

ApRv  -  6k  x  [V(t1)  -  V(t2)]  =  6k  x  6V,  (11) 
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where 


B  -  [-aKv(tc)]"1  ,  (12) 

since  K^(tg)  «  Kv(t^,)  -  0.  All  that  is  known  about  the  arguments  t^  and  t2 

is  that  they  are  in  the  intervals  [t  ,  t  J  and  [t  ,  t  ),  respectively. 

X  c  c  s 

However,  since  the  differentiated  kernel  functions  have  well-defined  peaks 
in  each  of  these  intervals  (see  curve  in  Fig.  1),  reasonable  approxima¬ 
tions  for  t^  and  t2  are  the  pressure  levels  of  these  peaks. 

Since  6V  can  be  determined  from  Eq.  (11),  one  can  write  the  required 
geostrophic  wind  components  as  follows: 

3R 

u(t1)  =  u(t2)  +  <5u  -  u(t2>  +  B(t^-)  (13) 

3R 

v(t^  =  v(t2)  +  6v  =  v(t2)  -  BOjj^-)  ,  (14) 

where  the  wind  components  at  level  t^  are  inferred  from  those  given  at 
level  t2  and  from  the  components  of  the  radiance  gradient.  Eqs.  (13)  and 
(14)  are  the  results  obtained  by  Zak  and  Panofsky  (1963);  however  something 
more  can  be  done  to  improve  them. 

The  two  modes  of  curve  in  Fig.  1  are  so  broad  that  there  is  a  high 
probability  that  the  components  of  the  radiance  gradient  do  not  represent 
the  geostrophic  wind  component  differences  at  levels  t^  and  t2  as  determined 
by  the  peak  values.  Clearly,  what  is  required  is  that  the  two  modes  of  the 
bimodal  kernel  function  derivative  be  made  to  approach  Dirac  delta  functions 
(distributions)  as  clearly  as  possible.  A  method  for  achieving  this  is 
provided  by  a  variant  of  the  Backus-Gilbert  method  (Backus  and  Gilbert,  1970). 
(Also  see  Conrath,  1972.) 
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3.  MODIFIED  BACKUS-GILBERT  METHOD 


Since  radiances  are  measured  in  a  number  of  channels,  it  is  reasonable 
Co  use  all  of  those  radiances  Chat  do  not  have  a  component  of  ground  radia¬ 
tion  in  a  linear  combination  to  accomplish  the  objectives  described  in  the 
preceding  section.  Begin  by  replacing  the  continuous  variable  \>  in  Eq.  (9) 
by  the  index  i  to  signify  the  ith  channel,  that  is,  write  (9)  in  the  form 


V  R. 
P  i 


-ak  x 


U 

/ 


V(t) 


dK^t) 

“dt 


dt. 


i-1,... 


(15) 


If  one  now  multiplies  each  of  these  n  equations  by  a  yet-unspecified  coeffi¬ 
cient  c^  and  then  sums  the  resulting  n  equations,  the  following  linear  com¬ 
bination  is  obtained: 


n 


£ 


Ciri  =  x 


dK^t) 

“dt 


]  dt . 


(16) 


The  quantity  in  square  brackets  is  called  the  averaging  kernel  and  will  be 
denoted  by  D,  that  is. 


D(c, t) 


n 


£ 


dKi(t) 
Ci  “dt 


(17) 


One  is  free  to  choose  the  coefficients  c^  of  (16)  in  any  reasonable 
manner;  therefore,  they  should  be  chosen  so  as  to  give  the  averaging  kernel 
(17)  the  following  properties: 


1.  It  should  be  blmodal  with  a  positive  and  a  negative  lobe,  each  of 


which  approximates  a  Dirac  delta  function  as  closely  as  possible  at 
predesignated  pressure  levels  t^  and  t^. 

2.  The  two  lobes  should  each  have  unit  area,  but  the  areas  should  be 

of  opposite  sign,  and  the  crossover  point  should  be  at  tc  »  (tj+tj) /2. 
The  first  property  is  realized  by  minimizing  the  quadratic  form 

tg 

Q(c)  «  j  (t-t1)2(t-t2)2[D(c,t)-6(t-t1)+6(t-t2) ]2dt 


(t-t1)2(t-t2)2  D2(c,t)  dt 


(18) 


2  2 

with  respect  to  the  vector  c.  The  factors  (t-t^)  and  (t-t2)  in  (18)  are 
introduced  to  play  the  role  of  penalty  functions.  They  are  penalty  functions 
because  they  cause  Q  to  grow  rapidly  in  magnitude  if  the  squared  quantity  in 
brackets  deviates  substantially  from  zero  away  from  the  points  t^  and  t2« 
Thus,  Q  is  penalized  least  if  D  is  a  maximum  in  the  vicinity  of  t^  and  t2 
and  a  minimum  elsewhere.  The  second  property  listed  above  translates  into 
the  two  constraints 


and 


D(c,t)  dt 


t 

s 

D(c,t)  dt  *  1 


t 

c 


D(c,tc)  =  0, 


(19) 


(20) 
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with  t  ■  (t.+t9)/2. 
c  I  i 

We  now  wish  to  determine  the  vector  c  by  minimizing  the  quadratic  form 
(18),  subject  to  the  constraints  (19)  and  (20).  This  is  a  classical  prob¬ 
lem  that  is  readily  solved  by  introducing  a  pair  of  Lagrangian  multipliers 
X  and  y  and  combining  Eqs.  (18),  (19)  and  (20)  into  a  single  expression, 
defined  by  Q,  that  must  be  minimized,  where 

Q(c)  =  cTSc  +  X(cTa-l)  +  y(c^S).  (21) 


The  elements  of  the  matrix  S  and  the  vectors  a  and  b  are,  by  (17),  (18), 
(19)  and  (20),  respectively, 


t 


’ij 


,  „  dK  (t)  dK  (t) 

(t-tl)2(t-t2)2  dt. 


(22) 


dK±(t) 

“dt 


dt 


dK  (t) 

“dt -  dt  =  Ki(tc)  ’ 


(23) 


and 


dKi(tc> 


i  dt 


(24) 


for  i,j“l,...,n.  The  coefficient  vector  c  which  minimizes  expression  (21) 
satisfies  the  normal  equation 


or 


-  2 Sc  +  Xa  +  Mb  -  <$, 
dc 


c  -  -  j  S  1(Xa  +  yb) 


(25) 
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Multiplying  c  by  a  and  b,  one  satisfies  constraints  (19)  and  (20),  that 


is. 


-►T-+ 
c  a 


1,,-*T  ^ 
■^(Aa  + 


yb  ) 


S-1a 


1 


(26) 


and 


c  b 


|(AaT  +  ybT)  S_1b 


0. 


(27) 


If  one  sets 


1  -+T  -1-*- 

a  ■  -  y  a  S  a,  B 


I  tT.-l? 


1  iT.-l-*' 


-  |  aSb  -  -  y  bS 


1  -+T  -1-*- 
-  y  b  S  b, 


(28) 


then  (26)  and  (27)  reduce  to  the  pair  of  simultaneous  equations 


a\  -  By  -  1 

BA  +  yy  =  0, 


whose  solutions  are 

A  -  -y-* -  and  y  «  - ,  (29) 

B  -  ay  3  -  ay 

2 

provided  B  -  ay  +  0.  Substitution  of  (28)  into  (29),  and  in  turn  (29)  into 

(25) ,  yields  the  required  coefficient  vector  c. 

We  will  now  have  to  digress  somewhat  because  when  the  calculations  of 

Eqs.  (22)  through  (29)  were  performed  with  real  data  disaster  struck  quickly. 

The  difficulty  was  not  with  the  theory,  but  with  an  unexpected  practical 

matter,  namely,  the  system  of  equations  (26)  and  (27)  was  singular.  In 
2 

other  words,  B  -  ay  -  0  in  solution  (29),  or  nearly  so.  The  reason  this 
happened  was  that  for  most  of  the  cases  tried, the  zero  crossover  naturally 
fell  at  the  point  (t^+t2)/2,  without  having  to  stipulate  it  as  such  through 
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Eq.  (20).  Hence,  the  constraint  (20)  was  redundant  with  what  was  already 
an  accomplished  fact  and  the  system  became  singular.  To  rectify  the  situa¬ 
tion  it  is  necessary  to  delete  constraint  (20),  which  resulted  in  the 
following  changes.  (The  cases  in  which  the  system  was  not  singular  will 
be  discussed  later.) 

First  expression  (21)  takes  the  form 

Q(c)  -  cTS-1c  +  X(cTa-l),  (30) 

which  when  minimized  has  the  solution 


(31) 


(32) 


(33) 


which  is  the  required  coefficient  vector. 

If  the  coefficients  obtained  from  Eq.  (33)  are  now  used  in  Eq.  (17), 
an  explicit  expression  for  the  averaging  kernel  D  is  obtained.  An  example 
of  such  a  curve  is  plotted  as  the  dashed  curve  in  Fig.  1  for  which  n  =  4. 
The  curve  was  chosen  particularly  because  its  negative  lobe  best  coin¬ 
cides  with  the  negative  lobe  of  the  solid  curve  labeled  D^.  Recall  that 
D^,  which  was  described  previously.  Is  based  on  a  single  kernel  function. 
In  order  to  make  the  proper  geometric  comparison  between  curves  and  D2, 
they  each  had  to  be  normalized  by  dividing  the  original  functions  by  the 


maximum  of  their  absolute  values. 


Two  Important  facts  emerge  when  comparing  the  single  channel  approach, 
as  exemplified  by  curve  D^,  with  the  multichannel  approach,  as  illustrated 
by  curve  D2.  First,  both  lobes  of  D2  are  much  narrower  than  those  of  D^; 
hence,  the  mean-value  assumption  required  to  obtain  Eq.  (11)  is  more 
realizable  for  the  multichannel  case.  Second,  the  levels  at  which  the 
positive  and  negative  lobes  of  D2  peak  are  much  closer  together  than  they 
are  for  the  two  lobes  of  D^.  This  permits  the  wind  vector  difference 
V(t^)  -  V(t2)  in  (11)  to  be  calculated  more  accurately. 

When  Eq.  (16)  and  the  bimodal  averaging  kernel  (17)  are  used  in  place 
of  Eq.  (10),  one  obtains  the  new  result 


-ak  xi 


$(t)D(c,t)dt  -  f  V(t)  D(cTt)  dtl  , 


and  then  Eq.  (11)  takes  on  the  form 


^  c^r^  =  -ak  x  [V(t^)  -  V^)]®  -ak  x  6V. 


The  required  geostrophic  wind  components  now  have  the  more  accurate  form 


u<tl>  '  "(t2>  ‘a]£Ci 


v(tl)  '  v<t2>  Ci 


]  6 


As  before,  the  wind  components  at  level  t^  are  inferred  from  those  given  at 
level  t2  and  from  the  components  of  the  radiance  gradients,  but  now  there 
is  a  linear  combination  of  radiance  gradients  in  place  of  a  single  gradient 
component . 

4.  BEST  LAYERS  FOR  OPERATIONAL  SOUNDERS 

Of  course  the  purpose  in  deriving  Eqs.  (36)  and  (37)  is  to  apply  these 
results  to  actual  satellite  data.  More  is  said  on  this  subject  subsequently, 
but  in  this  section  we  wish  to  take  advantage  of  the  diagnostic  tools 
afforded  by  the  averaging  kernel  technique.  Since  the  levels  t^  and  t2  at 
which  the  bimodal  averaging  kernel  lobes  peak  are  specified  by  the  user, 
averaging  kernels  can  be  calculated  for  all  possible  combinations  of  pairs 
of  levels  out  of  a  given  finite  set  of  levels  (t^).  From  this  collection 
of  calculations  one  can  determine  the  layers  over  which  the  thermal  winds 
can  be  optimally  derived.  The  outcome  depends  on  the  particular  sounding 
instrument  being  used,  that  is,  on  the  shapes  and  numbers  of  kernel  functions 
that  are  available. 

Optimum  layers  were  determined  for  the  two  currently  operational  sounders, 
namely,  the  DMSP-SSH  sounder  and  the  NOAA-TOVS  sounder.  The  SSH  has  six 
channels  in  the  15-ym  CO2  absorption  band  that  can  be  used  for  temperature 
sounding,  but  since  two  of  these  channels  sense  the  ground,  only  4  channels 
can  be  used  to  derive  vertical  wind  shear.  They  are  centered  at  668.5,  677, 
695,  and  708  cm  On  the  other  hand,  the  TOVS  sounder  contains  three 
separate  instrument  packages:  (1)  the  High  Resolution  Infrared  Radiation 
Sounder  Number  2  (HIRS/2)  with  20  channels  primarily  in  the  infrared  region 
of  the  spectrum,  (2)  the  Microwave  Sounding  Unit  (MSU)  with  4  channels  in 
the  5.5-mm  oxygen  complex,  and  (3)  the  Stratospheric  Sounding  Unit  (SSU) 


with  3  channels  that  employ  a  selective  absorption  technique  in  the  15-um 
CO 2  band  to  sound  the  middle  stratosphere. 

Two  separate  tests  were  conducted.  The  first  test  used  only  the  four 
most  absorbing  SSH  channels  in  Eq.  (17)  to  obtain  the  bimodal  averaging 
kernels.  In  order  to  determine  the  optimum  pressure  interval  over  which 
to  calculate  the  wind  shear,  averaging  kernels  for  45  different  combina¬ 
tions  of  pressure  levels  t^  and  t2  were  computed.  There  was  no  need  to 
calculate  averaging  kernels  for  additional  combinations  of  pressure  levels 
because  the  results  from  neighboring  levels  changed  by  only  small  amounts 
and  in  a  predictable  manner.  A  search  was  made  of  the  45  calculated 
averaging  kernels  for  the  best  ones.  The  criterion  of  "best"  is  those 
that  have  the  narrowest  main  lobes  while  at  the  same  time  have  minimal 
side  lobes.  After  the  search  was  completed  two  cases  were  found  to  be 
superior  to  all  the  others  in  satisfying  the  criterion  of  "best".  The 
averaging  kernels  of  these  two  cases  are  shown  in  Fig.  2  and  are  labeled 
and  D^;  they  also  were  normalized  as  before  for  comparative  purposes. 

One  concludes  from  these  two  curves  that  the  best  pressure  intervals 
are  from  12  to  125  mb  and  from  80  to  400  mb. 

Expression  (6)  for  the  kernel  functions  indicates  a  temperature  depen¬ 
dence;  therefore,  kernel  functions  were  calculated  for  9  very  different 
U.S.  Standard  Atmosphere  Supplements  (1966).  However,  the  same  two  pairs 
of  layers  that  were  just  given  were  found  to  be  best  in  each  case. 
Although  the  averaging  kernels  for  a  fixed  pair  of  levels  t^  and  t2  differed 
in  shape  for  the  9  temperature  profiles  studied,  the  differences  were  only 
in  the  fine  structure  -  the  basic  shape  remained  the  same.  (The  averaging 
kernels  shown  in  Fig.  2  are  for  the  45°N-July  Supplementary  Atmosphere.) 

Of  course,  the  reason  for  this  is  that  the  basic  shape  of  the  kernel  func¬ 
tion  given  by  (6)  is  dictated  by  the  weighting  function  dT^(t)/dt. 
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The  second  test  used  a  total  of  11  channels  from  the  TOVS  Instrument 
in  Eq.  (17)  to  obtain  the  bimodal  averaging  kernels.  None  of  the  MSU 
channels  were  used  in  this  test  because  the  field  of  view  is  much  larger 
than  that  of  the  HIRS/2  instrument  and,  therefore,  could  affect  the 
accuracy  of  the  horizontal  radiance  gradients,  relative  to  those  for  the 
other  instruments.  The  SSU  also  has  a  larger  field  of  view  than  that  of 
the  HIRS/2  instrument,  but  because  the  stratosphere  is  believed  to  be  more 
horizontally  homogeneous  than  the  troposphere  is,  the  three  SSU  channels 
were  used  along  with  8  of  the  most  opaque  of  the  HIRS/2  channels.  The 
HIRS/2  channels  used  were  centered  at  668.5,  680,  690,  703,  716,  2240, 

2270,  and  2360  cm-1. 

The  same  45  pairs  of  levels  t^  and  t2  as  were  used  before  were  used 
in  calculating  the  bimodal  averaging  kernels  for  the  11  TOVS  channels. 

This  time  the  search  for  best  levels  lead  to  three  cases  which  are  shown 
in  Fig.  3  as  curves  D^,  and  D^,  which  were  normalized  as  before  for 
comparative  purposes.  These  3  curves  are  more  "wiggly"  than  those  shown 
in  Fig.  2  for  two  reasons:  (1)  they  are  based  on  a  midlatitudinal  tempera¬ 
ture  profile  that  had  more  fine  structure  than  the  supplemental  temperature 
profile  used  for  Fig.  2,  and  (2)  as  the  number  of  kernel  functions  used  in 
the  linear  combination  (17)  is  increased,  more  structure  is  possible  in 
the  resulting  averaging  kernel.  Calculations  of  averaging  kernels  were  also 
made  for  a  tropical  and  a  polar  temperature  profile  for  the  11  channels,  but 
again  the  averaging  kernels  were  sufficiently  similar  to  those  plotted  in 
Fig.  3  that  the  conclusions  about  the  best  levels  remain  the  same. 

The  conclusions  to  be  drawn  from  Fig.  3  are  that  the  best  pressure 
intervals  for  the  TOVS  sounder  (minus  the  microwave  channels)  are  from  25 
to  150  mb,  from  80  to  325  mb,  and  from  250  to  625  mb.  As  would  be  expected. 


the  main  lobes  are  narrower  for  the  11-channel  averaging  kernels  of  Fig.  3 
than  they  are  for  the  4-channel  kernels  of  Fig.  2.  Averaging  kernels  were 
also  calculated  for  only  the  8  HIRS/2  channels.  The  same  three  pressure 
intervals  were  optimal,  but  the  narrowness  of  the  main  lobes  and  size  of 
the  side  lobes  were  degraded  in  a  predictable  way,  that  is,  curve  was 
degraded  the  most,  curve  was  affected  only  slightly,  and  curve  was 
virtually  unaffected. 

There  is  a  caveat  in  what  has  been  discussed  in  this  section  in  that 
there  should  be  one  more  best  result  in  Fig.  2  and  two  more  in  Fig.  3. 

They  were  lost  when  Eqs.  (21),  (25),  (26),  (27),  and  (29)  were  replaced  by 
Eqs.  (30,  (31),  (32),  and  (33).  As  stated  earlier,  these  replacements  were 
made  because  in  the  vast  majority  of  cases  the  system  of  equations  (26)  and 
(27)  became  singular.  However,  in  the  cases  for  which  this  was  not  true, 
valid  results  were  obtained.  However,  at  the  time  the  singular  matrices 
were  found  the  computer  program  was  thought  to  be  faulty  and  the  printout 
was  thrown  away.  In  retrospect  it  is  clear  that  there  were  cases  where  the 
zero  crossover  did  not  occur  at  the  level  (tj+t2)/2.  In  these  cases  the 
calculated  results  with  the  two  constraints  left  intact  were  valid.  However, 
rather  than  advocate  an  approach  where  two  different  programs  must  be  run 
to  get  one  set  of  results,  the  author  would  like  to  present  an  alternative, 
but  mathematically  equivalent,  single  approach  that  yields  the  complete  set 
of  optimal  averaging  kernels. 


5.  AN  ALTERNATIVE  APPROACH 

Recall  that  in  Section  2  it  was  stated  that  one  can  work  with  either 
Eq.  (8)  or  Eq.  (9),  and  that  Zak  and  Panofsky  (1968)  used  Eq.  (9).  In  the 
alternative  approach  developed  in  this  section  we  begin  with  Eq.  (8),  that  is, 
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(38) 


The  intent  here  is  again  to  use  a  modified  Backus-Gilbert  method  to  produce 
optimal  averaging  kernels,  but  because  the  derived  quantity  in  (38)  is 
vertical  wind  shear  instead  of  the  geostrophic  wind  vector,  different 
optimization  criteria  are  required. 

Following  the  development  in  Section  3,  one  multiplies  each  of  the  n 
equations  in  (38)  by  an  undetermined  coefficient  c^  and  then  sums  the  result¬ 
ing  n  equations  to  obtain  the  single  equation: 
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dt. 


(39) 


The  quantity  in  square  brackets,  though  different  from  that  of  Eq.  (16), is 
again  an  averaging  kernel  which  will  be  denoted  by  R,  that  is, 


n 

R(c,t)  =  ciK1(t)  . 
i=l 


(40) 


This  time  the  choice  of  coefficients  has  a  different  motivation.  We  are 
interested  in  determining  the  average  wind  shear  between  levels  t^  and  t2» 
that  is, 
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Therefore,  if  in  Eq.  (39)  the  averaging  kernel  were  a  rectangular  unit  func¬ 
tion  defined  by 


(1  for  t.  <  t  <  t»  \ 
0  otherwise  J 


then  the  mean  value  theorem  applied  to  Eq.  (39)  would  yield  the  average 
wind  shear  defined  by  (41) . 

As  a  consequence  of  the  foregoing  the  averaging  kernel  (40)  should  have 
the  following  properties: 

1.  It  should  approximate  the  rectangular  unit  function  p,  as  defined 
by  (42),  as  closely  as  possible  for  predesignated  pressure  levels 


t^  and  t 2* 

2.  The  nonzero  portion  of  the  averaging  kernel  (i.e.,  between  t^  and  t,,) 
should  have  a  uniform  height  of  unity. 


The  first  property  is  realized  by  minimizing  the  quadratic  form 


with  respect  to  the  vector  c.  The  factor  [ 1— p ] ^  is  a  penalty  function.  The 
second  line  of  Eq.  (43)  says  that  the  portion  of  the  averaging  kernel  outside 
the  rectangle  should  have  a  minimum  value.  The  second  property  listed  above 
cannot  be  put  into  a  form  that  can  be  manipulated  analytically.  However, 
a  partial  solution  to  this  difficulty  is  provided  by  the  constraint  that  the 
averaging  kernel  between  t^  and  t ^  have  an  average  height  of  unity,  that  is, 
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that 


1  . 


(44) 


The  determination  of  the  vector  c  by  minimizing  the  quadratic  form  (43) , 
subject  to  the  constraint  (44),  is  again  the  problem  given  by  Eqs.  (30), 
(31),  and  (32),  whose  solution  is  given  by 


which  is  Eq.  (33),  except  that  the  elements  of  the  matrix  S  and  the  vector 
a  are  different.  In  this  case,  by  Eqs.  (40,  (43),  and  (44),  the  respective 
elements  are  given  by: 


(46) 


(47) 


for  i,j*l,...,n.  The  vector  c  given  by  (45)  is  the  required  coefficient 
vector,  which  when  applied  to  Eq.  (40)  provides  the  best  approximation  to 
a  rectangular  unit  function  by  the  averaging  kernel. 

When  (40)  is  applied  to  (39)  and  the  second  mean-value  theorem  for 
integrals  is  applied,  one  has 
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£  cA =  °* x  <If>  /*  R(?>t 

i*i  r 


)dt 


=  ak  X  <-^Y>  (tj-t^)  . 


(48) 


But  by  the  law  of  the  mean  for  derivatives 


v(t2)  -  v(ti}  _ 

t2  -  ci  =  "at" 


(49) 


so  that  the  required  geostrophic  wind  components  by  (48)  have  the  form 


14  3R 

u(tl)  =  u(t2)  +  <|f>  (t2-tl)  =  u(t2)  +  Ci  (g^)p, 


(50) 


i=l 


and 


3R 

v(tx)  =  v(t2)  +  <|^>  (t2-tx)  =  v(t2)  -  c±  (^i)p.  (51) 


i=l 


The  interpretation  of  Eqs.  (50)  and  (51)  is  (as  before)  that  the  wind  com¬ 
ponents  at  level  t^  are  inferred  from  those  given  at  level  t2  plus  the 
linear  combination  of  components  of  the  radiance  gradients.  Note  that, 
aside  from  a  difference  in  sign,  Eqs.  (50)  and  (51)  are  identical  to  Eqs. 
(36)  and  (37),  as  they  should  be  since  the  starting  points,  namely  Fqs.  (8) 
and  (9),  are  just  different  forms  of  the  same  equation.  However,  this 
numerical  exercise  has  produced  a  large  gain  in  that  all  optimal  pressure 
intervals  are  found  at  one  time  as  the  solution  to  the  problem  given  by 
Eqs.  (45),  (46),  and  (47). 
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6.  RESULTS 


The  two  tests  described  in  Section  4  were  also  applied  to  the  rectangu¬ 
lar  averaging  kernels  of  Eq.  (40).  All  details  and  conclusions  of  the 
tests  in  Section  4  remain  the  same,  including  the  kinds  and  numbers  of 
channels  used,  the  temperature  dependence  of  the  kernel  functions,  and 
the  combinations  of  pressure  levels  that  were  computed. 

In  the  first  test  three  averaging  kernels  (one  more  than  in  Section  4) 
were  found  to  satisfy  the  criterion  of  "best",  and  are  shown  in  Fig.  4 
with  the  labels  R^,  R2,  and  Rg .  They  do  not  have  "rectangular"  shapes 
because  the  kernel  functions  used  in  the  linear  combination  (40)  are  all 
rounded  in  the  manner  of  the  curves  shown  in  Fig.  4.  The  three  curves  in 
Fig.  4  indicate  that  the  best  pressure  intervals  for  the  DMSP 

sounder  are:  10  to  125  mb,  80-400  mb,  and  300-750  mb.  For  the  second  test 
the  best  rectangular  averaging  kernels  for  the  TOVS  sounders  are  shown  in 
Fig.  5.  These  five  curves  (two  more  than  in  Section  4)  yield  best  pressure 
intervals  of:  35-150,  115-330,  175-500,  285-625,  and  400-875  mb. 

A  comparison  of  the  unimodal  curves  in  Fig.  4  with  those  in  Fig.  5 
clearly  shows  the  lobes  of  Fig.  5  to  be  narrower  than  those  of  Fig.  4,  with 
the  important  consequence  that  the  sides  of  the  lobes  in  Fig.  5  drop  more 
vertically  than  they  do  in  Fig.  4.  Again,  this  is  to  be  expected  because 
the  additional  channels  that  are  used  to  produce  the  averaging  kernels  in 
Fig.  5  give  more  flexibility  to  the  shape  of  the  averaging  kernels.  Also, 
when  8  HIRS/2  channels  were  used  in  place  of  the  original  11  channels,  the 
conclusions  are  as  before,  that  is,  curve  R^  is  most  strongly  effected, 
curve  R^  is  effected  only  slightly,  and  curves  Rg,  R^ ,  and  Rg  are  virtually 
unaffected . 
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The  conclusions  of  Sections  4  and  6  as  to  which  pressure  intervals  are 


best  are  conveniently  summarized  in  Table  1.  The  additional  channels  on 
the  TIROS-N  satellite  provided  more  layer  options  than  were  available  from 

Table  1.  Best  layers  over  which  to  calculate  the  average 
vertical  wind  shear,  based  on  the  SSH  and  TOVS 
sounders,  for  the  bimodal  and  unimodal  averaging 
kernels . 


Bimodal  Averaging 
Kernel 

Unimodal  Averaging 
Kernel 

SSH 

12-125  mb 

10-125  mb 

Sounder 

80-400  mb 

80-400  mb 

* 

300-750  mb 

25-150  mb 

35-150  mb 

TOVS 

80-325  mb 

115-330  mb 

Sounder 

* 

175-500  mb 

250-625  mb 

285-625  mb 

* 

400-875  mb 

Missing  because  only  the  results  from  the  single— constraint 
problem  are  available  (see  the  dicussion  at  the  end  of 
Section  4) . 


the  DMSP  satellite.  Note  the  similarity  between  adjacent  entries  in  the 
two  columns.  The  slight  differences  in  pressure  intervals  can  be  attri¬ 
buted  to  differences  in  the  numerical  procedures  used  to  calculate  the 
averaging  kernels. 


7.  METHODS  FOR  DERIVING  WINDS 

Now  that  the  diagnostic  studies  have  been  completed,  we  can  discuss 
the  main  issue,  namely,  how  can  one  derive  winds  at  one  pressure  level, 
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given  the  winds  at  a  second  level  plus  the  gradients  of  the  radiances.  One 
method  has  been  discussed  in  Sections  3  and  5,  but  other  methods  are  also 
possible.  Brief  discussions  of  three  distinct  methods  are  now  presented. 

A.  Averaging  Kernels 

The  most  obvious  method  to  use  is  the  one  described  in  Sections  3 
and  5.  In  short,  one  prescribes  pressure  levels  t^  and  t£  and  then  uses 
Eqs.  (36)  and  (37),  or  equivalently,  Eqs.  (50)  and  (51).  However,  when 
these  equations  are  applied  to  real  data,  the  noise  in  the  measured  radiances 
must  be  taken  into  account.  A  solution  to  this  problem  is  given  by  Backus 
and  Gilbert  (1970).  Briefly,  one  must  either  derive  a  covariance  matrix  E 
for  the  measurement  error,  or  model  one  from  certain  assumptions  about  the 
behavior  of  the  noise.  Then  the  variance  of  the  solution  due  to  the  random 
errors  of  measurement  is  of  the  form 

a2(c)  =  cT  Ec.  (52) 

Now  instead  of  minimizing  quadratic  form  (18)  or  (43) ,  one  must 
minimize  the  linear  combination  of  quadratic  forms 

qQ(c)  +  (1-q)  a2(c) ,  (53) 

where  0  <  q  <  1.  Expression  (53)  must  also  be  subject  to  the  proper  con¬ 
straints,  which  can  be  (19)  and  (20);  or  just  (19);  or  just  (44).  The 
choice  of  the  value  for  the  scalar  q  depends  upon  the  tradeoff  arrangement 
one  wants  between  minimizing  noise  and  the  accuracy  of  the  shape  of  the 
averaging  kernel.  The  tradeoff  decision  depends  in  turn  upon  the  particular 
application  at  hand.  The  best  pressure  levels  to  apply  to  this  method  are 
listed  in  Table  1. 

This  method  should  be  used  with  real  data  only  if  the  following 
two  assumptions  apply: 
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1.  One  requires  only  geostrophic  winds,  not  the  true  winds,  and 

2.  The  transmittance  functions  of  the  radiative  transfer  equation 


are  known  with  reasonable  accuracy. 

B.  Vertical  Profile  Retrievals 

One  really  need  not  go  any  further  than  Eq.  (8)  or  (9).  Since  both 

of  these  equations  require  quantities  that  are  known  and  available,  namely, 

well-defined  kernel  functions  and  n  vectors  of  7  R. ,  one  can  retrieve 

P  i 

vertical  profiles  of  the  wind  shear  from  Eq.  (8)  or  vertical  profiles  of 
the  geostrophic  wind  vector  from  Eq.  (9).  These  kinds  of  retrieval  methods 
are  described  in  a  paper  by  Fleming  and  Smith  (1967),  and  are  applicable 
because  Eqs.  (8)  and  (9)  are  linear  Fredholm  integral  equations  of  the 
first  kind. 

When  Eq.  (9)  is  used, the  wind  profile  is  available  directly,  but 

since  it  will  be  accurate  in  only  a  relative  sense,  one  should  take  the 
— ► 

difference  <$V  between  the  wind  vector  profile  at  levels  t^  and  t ^  and  apply 

this  difference  to  the  first  parts  of  Eqs.  (13)  and  (14),  provided  the  wind 

components  u(t£)  and  v(t2>  are  known. 

If  Eq.  (8)  is  used,  then  the  retrieved  vertical  wind  shear  profile 

-*■ 

3V 

is  applied  to  Eq.  (41)  to  determine  the  average  wind  shear  <-^>  between 
levels  t^  and  t2*  This  average  value  is  then  applied  to  the  first  parts  of 
Eqs.  (50)  and  (51),  provided  the  wind  components  u(t2)  and  v(tj)  are  known. 
Again,  the  best  levels  at  which  to  apply  this  method  are  listed  in  Table  1. 
The  same  two  assumptions  that  were  listed  under  Method  A  above  apply  here 
as  well,  except  that  since  this  is  an  inverse  method,  the  transmittance 
functions  of  the  radiative  transfer  equation  must  be  known  with  even 
greater  accuracy  than  that  required  for  Method  A. 
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C.  Regression 


When  true  winds  are  required  (as  opposed  to  thermal  winds) ,  one  of 
the  most  appropriate  methods  is  ordinary  regression  in  which  the  radiance 
gradients  are  used  as  the  predictors.  To  carry  out  this  procedure,  one 
needs  to  compile  a  large  sample  of  true  wind  data  which  are  coincident  in 
time  and  place  with  satellite  radiance  measurements  from  which  horizontal 
gradients  can  be  calculated.  The  necessary  regression  coefficient  matrix 
can  then  be  computed  from  matrices  of  these  coincident  measurements.  This 
method  is  standard,  and  so  details  will  not  be  given  here.  One  can  find 
specifics  in  the  paper  by  Fleming  and  Smith  (1967).  The  best  layers  given 
in  Table  1  apply  to  the  regression  method  as  well  because  those  layers  are 
the  very  ones  over  which  the  radiance  gradients  are  the  most  sensitive  as 
predictors . 

8 .  CONCLUSIONS 

The  final  shapes  of  the  averaging  kernels  in  Figs.  2  through  5  are 
not  ideal.  One  reason  for  this  is  that  there  are  too  few  kernel  functions 
to  work  with,  but  a  far  more  important  reason  is  that  one  cannot  dictate 
the  shapes  of  the  individual  kernel  functions  because  they  are  prescribed 
by  the  physical  properties  of  the  atmosphere.  Consequently,  they  are  not 
even  linearly  independent  (that  is,  relative  to  the  noise  level  of  the 
measurements),  and,  therefore,  are  not  very  effective  basis  functions. 

The  use  of  the  words  "optimum"  and  "best"  in  this  report  needs  clarifi¬ 
cation.  For  a  given  set  of  pressure  levels  t^  and  t^  the  corresponding 
averaging  kernel  is  optimum  in  that  it  minimizes  the  defining  quadratic 
form.  Thus,  there  is  one  optimum  averaging  kernel  for  each  pair  of  given 
pressure  levels.  But  then  out  of  all  of  these  optimum  averaging  kernels 
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the  best  ones  were  chosen  to  be  the  ones  which  had  the  truest  main  lobe 
shape  and  the  smallest  side  lobe  amplitude. 

Even  though  several  methods  for  calculating  winds  were  presented,  the 
regression  approach  to  obtaining  true  winds  should  be  the  most  useful 
approach.  In  pursuing  this  approach  several  recommendations  can  be  made 
that  result  directly  from  this  study: 

1.  As  many  channels  as  possible  should  be  used  as  predictors,  even 
though  a  particular  kernel  function  may  contribute  only  weakly  to 
the  level  of  the  atmosphere  in  question. 

2.  Winds  cannot  be  inferred  with  equal  accuracy  at  all  levels  of  the 
atmosphere.  Since  the  preliminary  work  of  deciding  which  pressure 
levels  yield  the  most  accurate  winds  has  been  done  in  this  report 
(when  radiance  gradients  are  used  as  predictors),  one  need  only 
refer  to  Table  1  to  decide  on  the  pressure  levels  at  which  to 
collect  wind  data. 

3.  The  diagnostic  tools  developed  in  this  report  are  useful  in  help¬ 
ing  to  answer  questions  about  the  sensitivity  of  the  solution  to 
a  given  channel  or  about  the  results  that  can  be  expected  if  an 
instrument  other  than  the  SSH  or  TOVS  sounders  is  used  (e.g.,  a 
microwave  sounder) . 
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Figure  1.  The  normalized  derivative  of  a  single  kernel  function  (solid  curve)  versus 

the  normalized  averaging  kernel  of  four  kernel  function  derivatives  (dashed  curve) 
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TOVS  channels . 
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Figure  5.  The  five  best  unimodal  aver 
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